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I. INTRODUCTION 



Robert Head, in a recent article, made the amusing 
statement that "if we could imagine a present day Mark 
Twain somehow becoming envolved with computer technology, 
he might be expected to remark that "a management informa- 
tion system is rather like the weather: everybody dis- 

cusses it, but nobody does much about it” (8:4). 

Junior college administrators have been aware of the 
possibility of management information systems and have 
shown a great deal of interest in building common data 
banks to make operational, information systems of all 
^pes. Although the topic is widely discussed. Grant in 
Education and Data Processing reported that as of July 
1967, there was no school in the United States known to 
have operational a total information system (13:36). 

Only recently have schools done much about making 
information systems a reali^. A federal program called 
Projects to Advance Creativity in Education (PACE) , last 
year approved 14.6 million dollars in projects involving 
data processing systems (13:37). In addition, the National 
Science Foundation has established a new Office of Com- 
puting Activities to encourage the use of computers in 
educational applications. The hope of many school 
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districts is that, in the long run, the state can do some 
of the work for the districts; working on a common data 
base providing the department with needled statewide infor- 
mation in a single electronic language. 

The purpose of this paper to: (1) investigate the 

need for common data banks in California junior colleges, 
(2) determine the present development of information sys- 
tems lind data base design, and (3) determine the possible 
place of generalized file management systems in the devel- 
opment of junior college information systems. To further 
the writer’s understauiding of these software packages, 
forl^ hours of lecture were attended at the IBM Education 
Center in a class called "information Management System." 






II. NEED FOR A COMMON DATA BASE IN 
CALIFORNIA JUNIOR COLLEGES 

In 1967, the Danforth Foundation made a grant to 
sponsor a five-year program for the purpose of initiating, 
incorporating, and coordinating cooperative projects in 
Southern California junior colleges. Dr. Sheldon, co- 
Director of the UCLA Danforth Junior College Program, 
stated that early in the project it became evident that in 
order to implement cooperative projects , the same data was 
required from all participating colleges. Investigation 
of junior colleges in Southern California indicated that 
the collection and storing of similar data on their stu- 
dents, faculty, or college business was not a matter of 
routine (6: preface) . 

Many colleges fight the movement toward conmon data 
gathering because of the fear of losing institutional con- 
trol and individuality. Yet, if we analyse the amount of 
required data that is now gathered to answer state and 
federal questionnaires and reports, the fear of losing 
individual identity and control becomes, as Dr. Carhart 
suggests, an aca dem ic question (16:9). The pressures of 
growing enrollments, curricular modernisation, and shortage 
of funds no longer allow educational institutions the 
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luxury of trial and error procedures. Unfortunately ^ the 

economics of any one college completing all the programming 

it needs to automate data for research studies and for 

fiscal and student personnel procedures is prohibitive. 

The economic facts of life are pressing all of us to 
search for economical means to solve our problems. 

In order to cooperate with other colleges, common 
use of the same data collection systems and defini- 
tions is required. . . . The amount of time, money, 
and energy that can be saved is enormous if we can 
be assured that coiBpleted studies have a common 
data base. ( 16 : 11 ) 

Recently, there have been several projects funded 
which attenqpt to experiment with the cooperative sharing 
of computer resources as well as the cooperative sharing of 
collected data. An early project described in educational 
literature is the Suburban School Services Joint Board's 
"Total Information for Educational Systems." This is a 
Minnesota $450 thousand project in which educational data 
will be automatically generated and supplied to the state 
department of education ( 13 : 37 ). The "Area IX Total Infor- 
maUon System" is a $307 thousand project in Scott County, 
Iowa. The objective of creating a multi-county, multi- 
district information retrieval project is typical of the 
several similar projects being funded today ( 13 : 38 ). 

The planning conmittee for the Southern California 
Junior Collage Advisory Council recently studied the fea- 
sibility of a common data bank and a common data process- 
ing system for the California Junior College System. They 



made the follcwlng liqportant recommendations: 

!• A system based on the establishment of regional 
centers formed and operated by a group of coop- 
erating junior colleges. A optimum size for a 
regional center woiild permit it to support 5 to 
10 collages (25,000 to 100,000 students); 8 to 
12 such centers would be required for the state 
(6:3-2) 

Advantages s 

- Increased system capabilities such as a 
larger data base; greater flexibility, 
reliability, and over all performance. 

- Increased standardization. 

- Lower costs for equipment operation and 
maintenance. 

- Comparative information available among 
colleges. (6:3-1) 

2. Functional application, sequencing priorities, 
and schedules for iiiq)lementation. (6:5-2) 

This study also pointed out that some junior col- 
leges would more quickly have the need and the capacity to 
implement the suggested functional applications. It was 
the recommendation of this study that in the near future 
certain junior colleges act as pilot systems (6:5-1). 

When such a commitment is made at the state level for a 
pilot system, an important decision will have to be made 
as to the software used to de&ign, manage, and utilize 
that data base. The use of generalized file management 
system software is a possible approach to data base design. 



III. AN APPROACH TO DATA BASE DESIGN: 



NANAGSNENT INFORMATION OR 
INFORMATION MANAGEMENT? 

Existing files have historically been structured 
for individual storage within core and within secondary 
storage. Because the structure is for individual applica- 
tions and because common keys are then the only xneans of 
cross referencing iMO files, there tends to be a great deal 
of redundancy. Besides the storage costs for the duplica- 
tion of information in a variety of files, there is the 
increased possibility of duplicate data items not being 
updated on a certain file (10:26). 

Program maintenance is hampered by the present file 
structure, in that any change to the file format necessi- 
tates a diange to the appli cations program. In addition 
the programmer must deal with the entire record even 

though he might be working with only a small segment. This 
increases the complexity of the program, uses more core, 
and generally raises programming costs (10i37). 

Junior college administrative systems are required 
to collect certain defined data in order to answer state 
and federal questionnaires. This being the case, the 
defined data base itself is quite similar in junior 
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colleges throughout the state. Considerable work has 
already been done by groups r such as the Southern Calif or*" 
nia Junior College Advisory Council, to determine future 
research, administration, and educational data needs. 

Junior colleges in California have, to a varying degree, 
alrea^ structured files containing data for future needs. 
!Ehe problem lies in the fact that the structure and the 
maintenance of that data base differs widely among these 
jiinlor colleges in California. 

For this reason an immediate objective might be an 
"information management" system for California junior col- 
leges. Such a system might provide a means of creating 
and maintaining an information base from present disjointed 
file structures in an economical and efficient manner. 

It is the writer's observation that what most 
businesses today call a "management information" system, 
is really a method of achieving economical methods of 
collecting, transporting, processing, and displaying infor- 
mation (information management) . 

This information is not necessarily related to the 
organization and production of information to sup- 
port management decision making ... in fact much 
of the information being handled by today's auto- 
matic systems is valueless for this purpose. 

( 10 : 41 ) 

Not that all systems experts agree that the re- 
quirements of a policy-making function must be included in 
the development of management information systems. 
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Data Language- 1« It also contains a data oominunl cation 
facility which includes telecommunications, message sched- 
uling, checkpoints for the restructuring of data from the 
dieckpoints. 

The characteristics of Data Language-1 are the 
following: 

1. The ability to create, maintain, and expand the 
data base* 

2* The ability to handle variable length informa- 
tion* 

3* Provides the means to make the data and program 
independent* 

4* The ability to handle batch and online process- 
ing concurrently or independently* (14:6) 

Data Language-1 must be used to define the logical 
data structure of the data base* This is done external to 
the q^plications programs* It is then the responsibility 
of the applications programmer to provide common source 
programs linkage so that Data Language-1 sub-programs can 
be "called" to process input/output requests during execu- 
tion of the qpplicatioiic programs* 

The iq>plication programmer must write in one of 
three programming languages: COBOL, PLl, or ALC* This 

writer found the data baise language DLl to be quite similar 
to 360 Operating System Job Control Language* Although 
the language is essentially a macro language, it definitely 
is not a language easily mastered by non-programmers. In 
addition to mastering Data Language-1, the user must also 
have a knowledge of an applications language such as COBOL 
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3. In providing the me£ms to segregate the data 
base it should also permit guaranteed accuracy 
of updating by an automatic maintenance feature 
for all segments of the file. 

4. It is mandatory that the system permit variable 
length records so that students with many en- 
tries ... [or academic qualifications ^ etc.] 
would consume more file space than new students 
with few academic qualifications. 

5. It should permit ease of expanding or reducing 
the file to add both new entries such as new 
students and new areas such as additional (aca- 
demic] qualificatdLons for any given individual. 
(10:27) 

The Information Management System (IMS) announced 
by IBM is what the writer would call an "information man- 
agement" system. It is a data management system which 
segments an individual information system by accessing the 
data base outside of the application program. Additions 
can then be made to the file structure without necessitat- 
ing changes to the hundreds of programs that refer to the 
data base in general. The processing of a data base or- 
ganized by such an "information management" system has the 
additional following advantages: 

1. Elimination of redundant data on file. 

.2. Reduction of file maintenance. 

3. Reduction of errors due to updating. 

4. Consistency of information throughout the files. 

5. Reduction of data processing costs. 

6. Reduction of programming maintenance in that 
the application program is independent of the 
file organization. 

7. Less complex program logic r in that most of the 
1,0. functions are performed by the generalized 
software system. (10:28) 

In addition to software packages that manage data, 
the junior colleges should also consider the recently 
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develqped "generalized management information system" soft- 
ware packages. These generalized systems are variously 
referred to as "file management systems,” "management in- 
formation systems," and "information languages" (19:24). 
Essentially they are task-oriented languages rather than 
coiqputer oriented languages. One writer's opinion was that 
at least eighty percent of the applications encountered in 
data processing could be implemented without any formal 
programming being required if what is referred to in this 
paper as a "generalized file management system" (GFMS) were 
used (19:24). 

Like a "data management" system, a GFMS also pro- 
vides the primary functiwis of file creation, and file 
maintenance. In addition, however, provision is made for 
information retrieval and report generation tasks that can 
be handled by users rather than high level programmers. 

Many software houses have undertaken the development of 
GFMS software, urged on by the shortage of qualified pro- 
grammers, the more coin>lex architecture of third generation 
machines, and the lengthy in^lemention periods for rela- 
tively straight-forward applicatioxis. 

Many software executives feel that a GFMS package 
is a of circumventing the programming bottleneck 

faced by most con?>uter users (19:22). The development of 
these new software packages allow data processing managers 
to respond to executives who want to obtain data on demand. 
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Historically this could not be easily or inexpensively done 
with fixed programs* Even though these new 6FMS sofb^are 
pacdcages are expensive to develop, because many customers 
can use them the cost of development is spread over many 
users* The price to anyone individual user then becomes 
quite reasonable (15:31) * 

These generalized systems are developed with the 
knowledge of the inqportance of the man-’madtiine interface* 
For this reason these new programs axe often made easier 
for unskilled operators to use by causing the system to re- 
port back if an incorrect cosmand is encountered* This 
interface between man and program becomes critical when 
the GFMS is to be a real time system* It is this writer's 
opinion that the man-machine interface will be and should 
be one of the most critical factors in selecting a GFMS 
software package* 

The feasibility study for a common data bank pre- 
sented by the California Junior College Association recom- 
mends the use of general software packages and makes note 
of the following possible advantages: 

1* i^pplication8 on older conputers can be converted 
for use on newer models faster, and generally 
more economically than by reprogram min g * 

2* Packages are better debugged and more widely 
field tested when delivered than are most in- 
house programs* 

3* Documentation is generally superior to that de- 
velqped by a user's own programmers, thus 
facilitating conversion and future changes. 

4 * Package suppliers usually continue to maintain 
the package against changed requirements* 
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5. Availabili^ o£ package suppliers perscnmel who 
are generally more versatile and highly skilled 
than those of the user. (6:3-4) 

The next section of this paper inve-^tigates the 
characteristics of a "data management" system and then 
several "generalized file management systems." Until only 
recently, this type of software was supplied and supported 
:oy the conqputer manufacturer at no extra charge for the 
customer. For this reasm many customers have experi- 
mented with the manufacturer -3 generalized software, and 
therefore provided a pool of industrial esqperience for the 
uninitiated user. For this reason, the manufacturer's 
software will be analysed in depth. Advantages as well as 
disadvantages and shortcomings of the manufacturer's gen- 
eralized software packages will be noted for California 
junior colleges. 

The number of commercial GFMS software packages 
competing with the manufacturer's software is growing at 
an incredible rate and will grow at an even greater rate 
within the next couple of years. The author will atten^t 
to note the area where conqpetitive commercial systems hope 
to in^rove on the manufacturer's generalized software. 
Based upon what has been done in the past and what is 
claimed for the future by GFMS software packages, the 
author will attempt to specify some realistic selection 
criteria for a junior college system. 
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IV. GENERMiIZED FILE MANAGEMENT SYSTEMS 
SUPPORTED BY COMPUTER MANUFACTURERS 



Inff>fni>ition Management System (IMS) 

This system is best described by noting that it 
Qianages data more than it provides data to management. 

IBM describes it as a processing program designed to fa- 
cilitate the implementation of medium to large common data 
bases in a multi- implication environment. Both online 
message processing and conventional batch processing are 
separately or concurrently possible. A suggested strong 
point of the system is that it permits the evolutionary 
expansion of data processing applications from a batch 
only to a high volume information request teleprocessing 
environment (14:1). The objectives set for the system are 
the following: 

1. To provide data organization methods that are 
conducive to the creation and maintenance of 
large common data bases and the multiapplication 
use of these data bases. 

2. To provide the means to permit the user to fa- 
cilitate development and maintenance of a data 
base system in the batch processing environment. 

3. To provide the user with the ability to extend 
his data base processing to the teleprocessing 
or data communication environment. 

4. To provide the user with an efficient tele- 
communication ability for developing a high 
volume/ripid response online system. (14:4) 

IMS/360 is comprised of a data base facility called 
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Data Language- 1« It also contains a data communication 
facility which Includes telecommunications ^ message sched- 
uling, checkpoints for the restructuring of data from the 
checkpoints • 

The characteristics of Data Language-1 are the 
following: 

1. The ability to create, maintain, and expand the 
data base. 

2. The ability to handle variable length informa- 
tion. 

3. Provides the means to make the data and program 
Independent. 

4. The ability to handle batch and online process- 
ing concurrently or Independently. (14:d) 

Data Language- 1 must be used to define the logical 
data structure of the data base. This is done external to 
the applications programs. It is then the responsibility 
of the applications programmer to provide common source 
programs linkage so that Data Language-1 sub-programs can 
be "called” to process input/output requests during execu- 
tion of the qpplicatibiic programs* 

The application programmer must write In one of 
three programming languages: COBOL, PLl, or ALC. This 

writer found the data base language DLl tc be quite similar 
to 360 Operating System Job Control Language. Although 
the language Is essentially a macro language, it definitely 
is not a language easily mastered by non-programmers. In 
addition to mastering Data Language-1, the user must also 
have a knowledge of an applications language sudi as COBOL 
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to access the data base and retrieve Information. 

The data base itself is composed of data base 
records — a data base record being a collection (a variable 
numoer) of hierarchically related, fixed length data ele- 
ments, called segments. A "root" segment is the hipest 
hierarchical segment in the data base record. The writer 
found the “root" segment to be similar to the FD level 
indicator in the COBOL programming language. A dependent 
segment is a data base segment that relies on at least the 
root segioent for its full hierarchical meaning, therefore 
always being at a lower hierfurchical level than the root 
segment. The writer feels that a dependent segment is 
quite similar to the 02 level indicator in COBOL. There 
can be 255 segment types within a data base and fifteen 
levels of segment hierarchy. Although there is a maximum 
of 255 segment t^pes or names, a single segment type can 
occur any nuinber of times and this does not seem to be a 
major limitation* 

IMS/360 allows the user to attach information to a 
data base and makes it impossible for others to use or 
access the information* This characteristic is called 
"sensitivity" to one's own data* The writer finds this to 
be an adequate method of providing security for a data 
base* 

Minimiam system requirements are 12 8K bytes for 
batch-only processing and 256K bytes for teleprocessing 
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with 1I4S. In the later case the systenir for all practical 
purposes, would be dedicated to online IMS/360 applications 
without the concurrent running of batch- type jobs. In 
effect then, to use the capabilities of IMS/360 a System/ 
360 Processing Unit Model 1 (512K) with Multiplexor Chan- 
nel, 10 tape units, two 2314 Direct Access Devices would 
be a very typical system (14:31). The need for such a 
large machine configuration causes a severe limitation on 
the usefulness of IMS/360 for the California Junior Col- 
lege System. The writer knows of only one junior college 
in California that has equipment or plans to have equipment 
large enough to handle the IMS/360 software package. 

If California junior colleges adopt something like 
regional data processing centers, the availability of an 
adequate machine configuration for use is more feasible. 

The writer’s experience with IMS/360 found it to be 
truely a general purpose system which could be sq;>plied to 
any education organization as well as any company system. 
This chauracteristic is not unique to IMS/360. Neither is 
the fact that applications programs 2 uce given independence 
from the physical organization of the data. However, the 
method of handling variable length data, providing an ade- 
quate security system for data bases, and providing exten- 
sive checkpoint and restart facilities is an unusual 
characteristic for generalized file management systems. 

The system designers seem to have been quite 
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concerned about penoitting an evolutionary expansion of 
data processing applications from the batch environment to 
the teleprocessing environment. Because of this concern 
much statistical information is automatically collected to 
assist the user in evaluation of performance and changing 
communication requirements. 

This system became available from IBM in the summer 
of 1969 at a rental cost of $400 a month. This system 
could possibly be utilized by a regional group of jianior 
college with a large enough machine configuration or at a 
central state facility. 

Gener€dJ.zed Information System (GIS) 

As a system IBM's GIS provides facilities for de- 
fining ^ maintaining f and retrieving data from user files 
under the direction of the using installation (11s 1). GIS 
is purported to be designed for managers who need specific 
information as well as answers to ”what-if” type of ques- 
tions. Under GIS the con^uter is to be able to draw upon 
interrelated data from different information files in order 
to develop answers (4:23) . 

GIS functions are initiated and controlled by data- 
base-oriented macro instructions that constitute the 
problem language. Although this is purported to be a near- 
English language, many users feel that the ease of manage- 
ment useage only takes place if that management has had 
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actual programming experience (17:43). Nevertheless ^ the 
GIS language is made up of descriptive, procedural, and 
control commands whidh can work on files created by ALC, 

PLl, or CX)BOL programs. GIS "descriptive” statements ex- 
plain the design of each logical file in the data base and 
are processed to form a permanently stored control table 
that contains all logical physical characteristics of a 
file. A file is thus described only once, and its use in 
any subsequent task is accomplished by means of syntoolic 
identifiers. Procedural statements indicate the action to 
be taken and the conditions under which they are to be 
performed. A coherent set of procediiral statements con- 
stitute a task specification. Three types of task specifi- 
cations are possible: 

1. File creation — to establish a file for the first 
time. 

2. File update— to enter new data or delete or 
change data already in a file, and 

3. File inquiry — to locate, organize, summarize, 
and present data. (2:97) 

Statements in the control category establish oper- 
ating controls that mc^ modify or regulate the way in which 
other tasks and processing jobs are performed. Control 
statexoents may cause the following: 

1. A task specification to be stored in its origi- 
nal or executable form for later or repeated 
use* 

2. A saved task to be executed at a specified time 
or during a specified interval. 

3. A sequence of displays and con^>uter actions to 
be stored for recall as part of a structured 
xnan/roachine conversation from remote terminal 
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devices, or as elements of similar cne-and- 
response procedures. (2:98) 

In addition to standard Boolean operators, GIS per- 
mits four other conditioneil operators not commonly found in 
generalized file systeioss BETWEEN, SCAN, MASK, and CHANGE . 
The operator B£TVf£El!i determines whether a field value is 
between the two values given. The operator SCAN tests a 
field for a particular pattern of characters. The operator 
MASK tests a field for a match on certain characters while 
at the same time allowing other characters to take on any 
value. Finally, the CHANGE operator allows a test for a 
change in contents of a field from one record to the next. 

The GIS record structure is made up of logical 
records which may be stored sis a single physical record or 
as a physically split record. In the single physical 
record's case, the files are organized either as sequential 
or indexed sequential. The FILE KEY provides the field 
upon whidi an order is imposed. In the split-record situa- 
tion, the master segment is ordered according to the KEY; 
the trsdler segments have no order and sure instead bound 
to the master segment by links and chains. A link provides 
the means of finding the master segment from a trailer 
segment. Chains link master to trsdler, trailer to 
trailer and eventually the last of a sequence of trailers 
back to the master. 

GIS *8 Data Description Tables (DDT) are permanent 
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tables which relate syitibolic filCp field, and segitent 
names to physical characteristics in the systera. The for- 
mat and content of the DDT as well as the characteristics 



at record and sub-record level 
available is presented below. 

64K 



Eecord Size 2K 

Number of Cross 

Indexes ^ 

Segments 10 

Nui^er of Files 

Processible 2 (5) 

Fields 100 



according to core memory 



12 8K 256K 



4K 8K 

10 20 

30 100 

3 (6) 4-6 (12) 

250 500 

(17:36) 



Many system experts consider GIS to be a comprehen- 
sive generalized system of significant utility. However, 
the usefulness of the l 2 uiguage has been questioned because 
of slippages over the last three years as far as full im- 
plementation of the system. Poor performance, considering 
the speed of the equipment, due to its interpretive mode of 
execution has been a major drawback (19:26). 

As far as implementation of GIS on a regional 
jxmior college system, the realistic minimum machine con- 
figuration size of 12 8K again iTiight be economically re- 
stricting. This writer will look with great interest into 
GIS (Basic) which is supposed to require a smaller machine 
configuration size of 65K. The writer found no company 
presently fully inplementing GIS (Basic) • 



Information Oriented Language (INFOL) 

INFOL is a tape-oriented system developed by Control 
Data Corporation for its 3600 and 3800 series of oonputers* 
Implemented in assembly language, it runs under the SCOPE 
operating system. 

The type of user language is free form using field 
numbers for data identification. Both fixed and variable 
record formats are accommodated. The operating environ- 
ments is batch-only with no support for online processing. 
INFOL is a tape system, thereby emphasizing sequential 
processing. The minimum machine configuration has a 
reasonable core size of 65K with five tape drives. Unfor- 
tunately, CDC has indicated that no future releases of the 
system will be made and the existing version will not be 
sv^ported (19:26). 

The characteristic capabilities of INFOL are similar 
to those of other task-oriented systems and are presented 
below: 

1. The system can be used by persons who have 
little or no knowledge of programming tech- 
niques . 

2. Conditions on fields in records include the 
relatioral criteria, "equal," "greater than," 
"less than," etc. 

3. Output may be defined by the user or left to 
the system. 

4. Validation criteria may be in^osed upon input. 

5. File revision, necessary after extensive up- 
dating (with additions and deletions especial- 
ly) , can be handled with relative ease. 

6. Simple confutations. Including averaging, can 
be effected through simple system commands. 



7. Extensive diagnostic messages (about 250) pre- 
vent a variety of xniscues. (17:13) 

One of the unique aspects of INFOL is its approach 
to formatting* INPOL stores information in a frcse-field 
fashion, therefore, needing no specification as to ^pe 
and number of characters in a field« INFOL, therefore, 
can process both fixed and variable length fields because 
they are handled precisely the same. 

The INFOL record itself is not hiereurchical in form. 
There is no restriction on the type of file that can be 
handled since it is possible to associate fields and nest 
the association* Because the data must be reformated be- 
fore the data base can be accessed, INFOL might be more 
closely compared to IMS/360 than to GIS. Because of the 
need for reformatting (called ESTABLISHMENT by CDC) , some 
system experts disqualify INFOL as a "Generalized" file 
ag<;»m^n t system on the same grounds as they disqualify 
IMS/360. However, this writer finds it important to note 
that INFOL qualifies quite nicely as a generalized file 
management system when on ease of customer usage; 

especially when compared to IMS/360. 

The writer finds INPOL to be a system of significant 
value and recommends that a regional junior college system 
considers its use if CDC equipment is involved. Unfor- 
tunately INFOL, like IMS/360 and GIS, is not readily 
adaptable to another manufacture's machines. Certainly this 
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factor is a big disadvantage of manufacturer supported 
generalized file management software packages and a main 
advantage of some commercial systems* Once designing a 
junior college Management Information System around a sys- 
tem like lMS/360 or IKFOL, change to another computer 
manufacturer presently becomes very costly and time con- 
summing. 

INFOLr CIS, and lMS/360 were the three manufacturer 
systems studied in depth, 2 uialysing users manuals or 
attending the manufacturer's school. Three other systems 
are mentioned in the literature and are now briefly dis- 
cussed. 

Business EDP Systems Technique (BEST) 

BEST is an integrated system of fifty -five data 
processing functions which were designed to circumvent the 
programming of at least 50 percent of the explications en- 
countered by users of NCR's 315 computer. The type of 
language is tabulcu:, the record formats are fixed, and the 
processing access method is sequential (19:26). 

BEST operates on a relatively small system (NCR 315 
and 20,000 character positions) with five tape drives. 

Like previously discussed manufacturer software, BEST is 
written in a language unique to the manufacturer (NEAT) . 
This of course makes BEST very difficult to adopt for 
another system even if the m 2 mufacturer would release the 
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software (19:27). 

BEST has been praised for its ease of customer 
usa^e. Although definitely a modest systero^ it has been 
able to surpass the goal of hamdling 50 percent of a com- 
pany's typical applications. On the minus side^ users re- 
port that the con?>iling of a BEST program is slow. This 
is explained by the fact that BEST generates a NEAT assem- 
bly language program that must also then be assembled 
(19:27) . 

Tn-fcegrated Data Store (IDS) 

IPS is one of the first atten^ts by a manufacturer 
to develop a supported generalized program to organize, 
store, maintain, and retrieve information. IDS is not an 
information storage euid retrieval system of and by itself, 
but is instead vised with COBOL to provide logical record- 
processing capabilities. IDS resembles IMS/360 in that it 
is not an inexperienced-user type of language; instead re- 
quiring a programmer's knowledge of IDS and COBOL 
(20:3-275). 

IDS language provides data description elements 
which declare the existence of master detail relationships 
between records. These relationships are inplexnented 
through chjdn link processing techniques which create 
circular or ring structures quite like IBM's CIS package. 
The master record links to the first detail record; the 
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first: detail record links to the second detail record; and 
so on unti 1 the last detail record in the chain ^ which 
links back to the master record. Records may optionally 
contain the address of the prior record and/or master 
record (20:3-231). 

IDS operates in either a batch or direct access 
mode and has the ability to store and retrieve records of 
any length within a mass memory. The records themself are 
defined by COBOL data descriptions to contain a specified 
number or fixed size fields and to participate in a speci- 
fied number of chains (20:3-234). 

The writer finds IDS to be similar to IMS/360 but 
tied more closely to the compiler language. It seems that 
IBM borrowed sever 2 J. concepts from this early GE system 
and then increased the flexibility of their cwn system by 
separating the data division entirely from the compiler 
language. For a COBOL shop using GE equipment IDS is a 
worthwhile system and should be considered. 

MANAGE 

MANAGE is a Scientific Data System develc^ed gener- 
alized file management system. It has been successfully 
used in wide variety of applications since early 1967. 
manage has particularly been praised as a flexible general 
purpose system which can be in^>leinented for small computer 
configurations. The core requirements for the SDS 900 
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Series is 8K (24 Bit words) or 16K for the SDS 9300 Series. 
Three tape drives, card reader, and line printer suce also 
reqtiired (19:26)* 

The language used by MMAGE is the SOS BUSINESS 
LANGUAGE which is tabular in form. SDS claims that this 
language is a madiine independent assembly language; a 
distinct advantage over the previously mentioned systems 
(19:26). 
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V. C(»9CLUSI0NS AND RECOMMENDATIONS 



One author writing about generalized file management 

systems made the following very true statement: 

As a general rule (and not too surprising) it is 
safe to say that what is on the drawing boards is 
somewhat si:^rior to what has been developed, 

(17:1) 

The writer feels that the above statement represents 
the crux of the problem regarding the use of generalized 
file systems* Many of the manufacturer systems 

require a large machine configuration which the individual 
junior college just cannot afford, especially if it is a 
small junior college. This is one of the reasons that the 
concept of a c ommo n central data bank and regional coooputer 
centers makes such good sense. Yet, it is important to 
note that critics of manufacturer supported software claim 
a lack of flexibility and/or inefficient performance of 
some generalized file management systems (19:22). Plans 
for more efficient softwauce systems as well as for smaller 
software systems are on the drawing boards of both con^uter 
manufacturers and commercial software houses; and offer a 

potential future solution. 

The writer finds it quite disturbing that many sys- 
tems tie the user into that manufacturer's system or at 
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least into the saioe family of systems. This of course is 
disturbing but not surprising. Because the computer at a 
junior college is so widely used for the two year conputer 
vocational program, many junior college teaching staffs 
perfer to offer the student training on IBM 360 equipment. 
Because the same equipment is often also used for adminis** 
trative functions, many administrative data processing 
<^partinents rely on software supported by IBM. Ihe writer 
suspects that IBM's GIS/BaSIC will be further refined to 
the point where it becomes more efficient as well as 
feasible for a small 360/30. However, an administrative 
system tied into GIS/BASIC will be difficult to redesign 
when there is a change to a new manufacturer or to a higher 
family of computers. 

Although IMS/360 is available only for very large 
360 *s, the writer suspects that a type of "IMS/BASIC" will 
be developed for the t^pe of smaller system that might be 
found in an individual junior college or regional junior 
college conputer systmsss. The concept of managing a data 
base outside of the application program seems to be a 
sound idea. IMS/360 or any data management system, would 
hopefully provide sufficient flexibility to move across 
manufacturer or conputer family lines. 

The writer is quite hopefull that software houses 
will fill in the gap left by manufacturer supplied GFKS 
packages for small computer users. These systems will most 
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likely be both more flexible and more efficient; so as to 
create a profit motive over a wide rauige of customers* 

Until there are a greater number of generalised file man- 
agement system alternatives, what steps can a junior col- 
lege system take to utilize present software packages in 
the creation and use of data base systems? 

For junior colleges to band together and create a 
meaningful common data base, coordination at the state 
level will be required to point out which hardware systems 
will be compatible as well as which software systems will 
be compatible* Companies adopting generalized file manage- 
ment systems have suggested that previous users are often 
the best source of information on limitations, errors, and 
other problems associated with using a software package 
(3:20)* It heus also been found that an organization should 
keep the software as near the vendors "off-the-shelf" ver- 
sion as possible to take advantage of the maintenance that 
the vendor provides as well as to in^rove the chances of 
getting proper assistance (3:21)* Discussions with present 
and past users of generalized file management system are a 
source of information that the individual junior college 
should utilize* The emphasis is made regarding the in- 
volvement of the individual junior college, because the 
writer feels that the support for state level coordination 
must first be desired by the individual junior colleges* 

Prior to the selection of an appropriate software 
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gys 'tliQ jyinirOir coH©5© wxXl Ixav© to b© invoXvGd, Xn tb© 

Xarg© task of designing and redesigning an ^propriate 

caniaon data base. It is hard enough to coXXect and save 

data which has been produced for some defined purpose, but 

what about data needed for future unpredictable purposes? 

Considerable additional study still must be done to de- 

tenoine what should be required in a common data base for a 

California Junior College System. Even after this is done 

a considerable selling job will have to be undertaken to 

acquire sufficient support and sufficient funds. Hayes in 

a recent article points out the need for justifications 

. . . whereas the original recording of the data had 
clearly defined value, related to the purpose of its 
generation, the utility for other, unknwn uses is 
not all clear. . . . Since this [collection of data] 
is a "capital investment," it must be amortized, in 
some sense, over these future, unknown uses made of 
it. As a result the costs of an information re- 
trieval system are difficult to justify. (7s 26) 

Even before the need for justification of a data 
beise system, the problem of junior college administrator's 
reluctance toward a data base system will have to be over- 
come. This will be true at both the state and individual 
junior college level; and is certainly not unique to the 
junior college environment. Corporations who have under- 
taU*r.en a program of building a data base to enable an infor- 
mation system to function, have found the following) factors 
to be a source of confusion and lack of understanding. 

1. Associates in other companies either praising or 
condemning the results of similar programs 
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undGrtak6n by tholr cooip®nies« This contradic- 
tion bocomes more confusing when whe reasons 
given for success or failure differ from person 
to person. 

2. Internal line and staff management personnel 
stating the absolute necessity for# or violent 
abjection to sudi an undertaking with each en- 
dorsement or objection supported with valid 
proof. 

3. Presentations by experts in the information pro- 
cessing field pointing out the simplicity or 
extreme con^lexity of such an undertaking. 

4. The sudden appearance of excessive costs# during 
or following installation# that were not con- 
sidered at the time of the original estimates. 

5. Underlying fear of personnel that installation 
of a systcun will uncover weak spots in the or- 
ganization that could lead to changes affecting 
them. (1:10) 

Finally# it is the writer’s opinion that the junior 
colleges can very profitably determine if their own en- 
vironment is such that the design# installation# and c<m- 
tinued growth of an information system is probable. Many 
systems experts feel that it is best to delay the »tart of 
cosqpany involvement in a complex automated information sys- 
tem until the following items are in place: 

1. Adequate corporate discipline so that ccmmon 
interfunctional procedures can be implemented. 

2. Documentation of potential savings anticipated 
from installation. 

3. A relatively stable management# especially at 
the policy-making level. 

4. Manaaement that is willing to commit its own 
tSe and interest to understand the various 
plans, techniques and equipment associated witt 
the prc^osed system. This understanding should 
be in siifficient detail to enable intelligent 
monitoring of the costs and progress of the 
system. 

5. Tlie willingness of management to start acquiring 
and training a core of experienced systems per- 
sonnel. 

6. The presence within the organization of 
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operational personnel who are knowledgeable in 
depth concerning the information requirements, 
methods, procedures and techniques within the 
functions they are associated with. (1:12) 

When the junior college has satisfied these reauion- 
able prerequisites related to people, the largest step 
tow surd a common data bank has been taken. Once this cam- 
mitment has been made to install a common junior college 
data system, differing means of entering that data 

base must be measured against sene criteria. The writer's 
investigations of con 5 >uter-manufacturer-supplied General- 
ized File Management Systems suggests several broad cata- 
gories of criteria: file independence, low problem 

definition, strong user orientation, and consistency of 
file processing. All computer manufacturers and all com- 
mercial software houses claim to offer a GFMS that meets 
at least these criteria to some degree. The problem is to 
determine that degree. 

The disadvantages connected with generalized sys- 
tems supplied by the manufacturer are often rationalized 
away by an absence of a purchase price or perhaps a very 
low purchase or lease price. This is often the claim of 
the enthusiastic salesman from a commercial software house. 
The truth is that there are additional is^lementation 
costs to be considered for all generalized file management 
systems whatever the source: 

1. Training costs 



2 . 



Manpower time 

3. Machine resources 

4. Costs of distributing and maintaining program 
documentation • 

It seems apparent, however, that what a commercial 
software house might offer is more efficient use of machine 
resources and a more efficient use of user time in return 
for a purchase or lease price. Certain factors are notice- 
ably variable among many of the generalized file management 
systems : 

1. Ease of integration with other software systems. 

2. The current status of the software package (e.g. 
testing or production stage) • 

3. Modification constraints. 

4. Number of existing installations. 

It seems to this writer that two of these factors are 
especially vital criteria when considering use in a junior 
college system. Because of a desire for a degree of 
autonon^ among individual junior colleges or among regional 
junior college systems, it is vital that a GFMS be compat- 
ible with a great variety of softwaure systems and modifica- 
tion constraints are flexible enough to adapt to various 
administrative systems. 

Many gcuieralized file management systems vary as to 
file and record definition. Because various file struc- 
tures must be accommodated in a junior college system, the 



following are recommended as criteria in selection of a 
GFMS. 

1. The ability to accommodate hierarchical rela- 
tionships in raw data* 

2. The ability to overlay descriptions on existing 
files* 

3. The ability to accommodate fixed, variable, and 
undefined record formats. 

Most generalized file management systems claim to be 
designed for the convenience of non-programmers. In com- 
paring specific systenus the following criteria should be 
considered to enable easy use by non- programmers * 

1. Shorthand default options* 

2. The facility for synonym searching, 

3. The cataloging of requests for future use. 

4. The ability to print successively a nunl)er of 
reports in ^e peiss throu^ the file* 

5. The existence of extensive diagnostics. 

Many marketable generalized file management systems place 
a heavy emphasis on their report-generating capabilities. 
There are certain criteria that can be used in judging this 
capabilil^: 

1. Flexibility in choice of output medium. 

2. Flexibility in format. 

3. Automatic security messages when data defined as 
confidential is retrieved. 
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The writer has earlier mentioned processing efficiency as 
one of the strongest points of a marketable GFMS for jvaiior 
colleges. What then are the criteria for measuring this 
efficiency of processing? 

1. The ability to effectively operate in a rela*- 
tively small system environment. 

2. The ability to include tcisk programs written in 
a procedural language. 

3. The ability to constantly check for data valid- 
ity. 

4. Data integrity. 

5. The ability to retrieve data from multiple- 
files during a single run. 

Historically, customer support has proven to be an impor^ 
tant factor in effectively installing a GFMS. The follow- 
ing criteria should be used as indicators of sufficient 
customer support. 

1. Sufficient documentation available on the sys- 
tem. 

2. Training requirements for users adequately 
planned for by the vendor. 

3. Service response on problems meets the user’s 
requirements . 

4. Maintenance support is included in the price of 
the package. 

In summary, the design of a common junior college 
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data base is first required before a Generalized File Man- 
agement System can be considered. A central state data 
base titystem requires a user oriented method of accessing 
the data base as well as an efficient method of managing 
the data. As this writer sees the problem; the computer 
manufacturer makes available a system to efficiently manage 
the data base, but does not offer a system taiilored to the 
non-technical user* The solution might be to efficiently 
manage the data base with provided software while evaluat- 
ing commercial Generalized File Management Systems. 

The writer has indicated what he considers to be 
important criteria in evaluating processing efficiency, 
factors of file and record definitions, convenience of use 
for non-programmers, report generating capabilities, and 
customer support. When the common data base system heus 
been designed, representatives from the individual jiinior 
colleges or from regional computer centers should be polled 
to include other possible criteria. The expense of evalu- 
ating the growing number of Generalized File Management 
Systems could be absorbed once by a central state agency. 
The purchase cost of such a system should be absorbed at 
the state level and the generalized software package made 
available to members of the junior college network. This 
idea of evaluating Generalized File Management Systems can 
be undertaken by regional data processing centers but with- 
out the economy of scale possible at the state level* It 
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is quite possible that more than one Generalized File 
Management System would be required to meet certain cri-- 
teria. Only at the state level can the purcheise or lease 
of more than one of these generalized systems be an eco~ 
n(md.c reality. 

The realistic sequence of activities must then be 
the creation of a common junior college data base, the 
creation of a central information network, criteria speci- 
fied for a Generalized File M2uiagement System; and finally, 
availability of this generalized system to access, re- 
trieve, and update the data base itself. 
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